Korpus: ara-ir_newscrawl-OSIAN_2018_100K

Weitere Korpora

5.1.3 Number of words without NN co-occurrences

ordered by frequency

Words without right neighbor
frequency # words percentage
1 108753 100.00
2 28461 99.99
3 13042 96.01
4 7492 91.88
5 4925 87.21
6 3576 83.75
7 2612 78.44
8 1881 74.70
9 1527 70.34
10 1232 67.18
11 954 61.55
12 790 58.00
13 663 53.81
14 519 51.54
15 432 46.55
16 372 43.51
17 340 44.50
18 260 39.16
19 237 40.44
20 195 32.50
21 201 34.48
22 179 35.94
23 147 31.68
24 117 28.33
25 133 33.00
26 93 26.72
27 88 25.36
28 85 24.43
29 72 24.24
30 56 18.54
31 55 20.22
32 55 21.15
33 44 16.48
34 35 14.58
35 27 14.06
36 37 16.37
37 22 10.28
38 34 17.17
39 23 13.53
40 19 11.59
41 21 12.35
42 19 11.95
43 14 9.15
44 12 7.64
45 12 7.74
46 8 5.88
47 10 7.81
48 12 7.79
49 11 8.09
50 5 4.59
Words without left neighbor
frequency # words percentage
1 108753 100.00
2 28461 99.99
3 12955 95.37
4 7394 90.68
5 4869 86.22
6 3482 81.55
7 2528 75.92
8 1791 71.13
9 1455 67.02
10 1149 62.65
11 879 56.71
12 759 55.73
13 651 52.84
14 481 47.77
15 400 43.10
16 367 42.92
17 296 38.74
18 246 37.05
19 202 34.47
20 215 35.83
21 172 29.50
22 164 32.93
23 136 29.31
24 103 24.94
25 100 24.81
26 69 19.83
27 84 24.21
28 80 22.99
29 50 16.84
30 70 23.18
31 65 23.90
32 39 15.00
33 53 19.85
34 47 19.58
35 36 18.75
36 44 19.47
37 24 11.21
38 27 13.64
39 30 17.65
40 24 14.63
41 24 14.12
42 25 15.72
43 20 13.07
44 20 12.74
45 16 10.32
46 14 10.29
47 12 9.38
48 15 9.74
49 16 11.76
50 9 8.26
1492 msec needed at 2022-07-05 23:03